Web-page recognition algorithm estimating text coherence
نویسندگان
چکیده
منابع مشابه
Estimating the Rate of Web Page Updates
Estimating the rate of Web page updates helps in improving Web crawler’s scheduling policy. But, most of the Web sources are autonomous and updated independently. Clients like Web crawlers are not aware of when and how often the sources change. Unlike other studies, we model the process of Web page updates as non-homogeneous Poisson process and focus on determining localized rate of updates. Th...
متن کاملA Score based Web Page Ranking Algorithm
With the explosive growth of information in the Web, users face difficulties while finding their desired information. Search engine helps the user by retrieving useful information from this huge collection based on his/her search query and presents a list of relevant web pages as a search result. However, without proper ranking of pages in the result through the relevancy of pages to the search...
متن کاملA Marker Propagation Algorithm for Text Coherence
Text coherence is a di cult problem in natural language processing A text is considered to be coherent when sentences follow logically one after the other In this paper we describe a computational method that provides an explanation why a text is coherent By providing such an explanation one can infer a number of assertions unstated in a text Our computational method is based on a parallel mark...
متن کاملA Novel Approach to Feature Selection Using PageRank algorithm for Web Page Classification
In this paper, a novel filter-based approach is proposed using the PageRank algorithm to select the optimal subset of features as well as to compute their weights for web page classification. To evaluate the proposed approach multiple experiments are performed using accuracy score as the main criterion on four different datasets, namely WebKB, Reuters-R8, Reuters-R52, and 20NewsGroups. By analy...
متن کاملWeb Page Ranking Based on Text Substance of Linked Pages
World Wide Web is large sized repository of interlinked hypertext documents accessed via the Internet. Web may contain text, images, video, and other multimedia data. The user navigates through this using hyperlink. Search Engine gives millions of results and applies Web mining techniques to order the results. The sorted order of search results is obtained by applying some special algorithms ca...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: On-line Journal "Naukovedenie"
سال: 2015
ISSN: 2223-5167
DOI: 10.15862/71tvn115